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Methods for Genetic Diversification 
in Gene Conversion Active Cells 

The present invention relates to a method for directed and selective genetic diversification of a 
target nucleic acid sequence or gene product by exploiting the relationship betwreen 
immunoglobulin gene conversion and hypermutation in antibody-producing cells, as well as to 
cells and cell lines capable of said genetic diversification. 

Many approaches to the generation of diversity in gene products rely on the generation of a veiy 
large number of mutants which are then selected using powerful selection technologies. However, 
these systems have a number of disadvantages. If the mutagenesis is done in vitro on gene 
constructs which are subsequently expressed in vitro or as transgenes in cells or animals, the gene 
expression in the physiological context is difficult and the mutant repertoire is fixed in time. If 
mutagenesis is on the other hand perfonned in living cells, it is difiScult to direct mutations to a 
target nucleic acid where they are desired. Therefore the efficiency of isolating molecules with 
improved activity by repeated cycles of mutations and selection with sufficient efficiency is 
limited. Moreover, random mutagenesis in vivo is toxic and likely to induce a high level of 
undesirable secondary mutations. 
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In nature, directed diversification of a selected nucleic acid sequeace takes place in the rearranged 
V(D)J segments of the immunoglobulin (Ig) gene loci. The primary repertoire of antibody 
specificities is generated by a process of DNA reairang^ent involving the joining of 
immunoglobulin V, D, and J gene segments. Following antigen encounter, the rearranged V(D) J 
segments in those B cells, whose surface Ig can bind the antigen with low or moderate afSnity, are 
subjected to a second wave of diversification by hypennutation. Thds so-called somatic 
hypermutation generates the secondary repertoire from which increased binding specificities are 
selected thereby allowing affinity maturation of the humoral immune response (Milstein and Rada, 
1995). 

The mouse and man inomunoglobulin loci contain large pools of V, D and J grae segm^ts which 
can p«uticipate in the V(D)J rearrangement, so that significant diversity is created at this stage by 
random combination. Other species such as chicken, rabbit, cow, skeep and pig employ a different 
strategy to develop their primary Ig repertoire (Butler, 1998). After the rearrangement of a single 
functional V and J segment, further diversification of the chicken light chain gene occurs by gene 
conversion in a specialized lymphoid oigan, the Bursa of Faibricius CReynaud et al, 1987; 
Arakawa and Bueistedde, in press). During this process, stretches of sequences from non- 
functional pseudo-V-genes are transferred into the rearranged V-geae. The twenty-five pseudo-V- 
genes are situated upstream of the fimctional V-gene and share sequence homology with the V- 
gene. Similar to the situation in men and mice, aSinity maturation after antigeii encounter takes 
place by hypermutation in the splenic germinal centers of the chicken (Arakawa et al., 1996), 

All three B cell specific activities of Ig repertoire formation - gene conversion (Arakawa et al, 

2002), hypermutation and isotype switch recombination (Muramatsu et al., 2000; Revy et aL, 
2000) - require expression of the Activation Induced Deaminase (AID) gene. Whereas it was 
initially proposed that AID is a DNA editing enzyme (Muramatsu et al., 1999), more recent studies 
indicate that AID directly modifies DNA by deamination of cytosine to uracil (Di Noia and 
Neuberger, 2002). However, the cytosine deammation activity must be further regulated, because 
only differences in the type, the location and the processing of the ADD-induced DNA 
modification can explain the selective occurrence of recombination or hypennutation in different 
species and B cell environments. Based on the finding that certain AED mutations affect switch 
recombination, but not somatic hypermutation, it was suggested that AID needs the binding of a 
co-factor to start switch recombination (Ta et al., 2003; Barreto et al, 2003). 

Analysis of DT40 knock-out mutants indicates that the RAD54 gene (Bezzubova et al., 1997) and 
other members of fiie RADS2 recombination repair pathway are needed for efficirat Ig gene 
conversion (Sale et al., 2001). Disruption of RAD51 analogues and paxalogues reduces Ig gene 
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conversion and induces hypennutation in the rearranged light chain gene (Sale et al., 2001) 
suggesting that a defect in DNA repair by homologous recombination can shift Ig gene conversion 
to hypennutation. 

Recently, first cell systems have been developed which exploit the phenomenon of somatic 
hypennutation in the immunoglobulin locus to generate mutants of a target gene in constitutive 
and directed manner. These cell systems allow to prepare a gene product having a desired activity 
by cyclical steps of mutation generation and selection. Thus, WO 00/221 1 1 and WO 02/100998 
describe a hmnan Burkitt lymphoma cell line (Ramos) which is capable of directed constitutive 
hypennutation of a specific nucleic acid region. This mutated region can be the endogenous 
rearranged V segment or an exogenous gene operatively linked to control sequences which direct 
hypennutation. A significant disadvantage of this cell system is that hximan cells cannot be 
efficiently genetically manipulated by targeted integration, since tiansfected constructs insert 
primarily at random chromosomal positions. 

WO 02/100998 also describes another cell system for generating genetic diversity in the Ig locus 
which is based on &e chicken B cell line DT40. DT40 continues gene conversion of the 
rearranged light chain inununoglobulin gene during cell culture (Buerstedde et al., 1990). 
•Importantly, this cell line has a high ratio of targeted to random integration of transfected 
constructs thus allowing efficient genetic manipulation (Buerstedde and Takeda, 1991). According 
to WO 02/100998, deletion in DT40 of the paralogues of the RAD5 1 gene which are involved in 
homologous recombination and DNA repair led to a decrease in g^e conversion and a 
simultaneous activation of hypennutation of the rearranged V segment. However, the main 
disadvantage of this system is that the mutant cells have a DNA repair deficiency as reflected by 
X-ray sensitivity and chromosomal instability. The mutants also have a low proliferation rate and a 
low gene targeting efficiency. ITierefore this system is poorly suited for efficient gene 
diversification and selection. 

The present invention overcomes the disadvantages of the prior art systems and provides further 
advantages as well. 

SUMMARY OF THE INVENTION 

In the first aspect of the invention there is provided a genetically modified lymphoid cell having 
gene conversion fully or partially replaced by hypennutation, wherein said cell has no deleterious 
mutations in genes encoding paralogues and analogues of the RAD51 gene which encode 
important homologous recombination factors. Specifically, the cell contains wild-type homologous 
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recombination factors. Due to the intact homologous recombination machinery, the cell according 
to the invention is recombination and repair proficient and has a normal proliferation rate. 

The ceU of the invention is an immunoglobulin-expressing B lymphocyte derived &am animal 
5 species which use the mechanism of gene conversion for developing their immunoglobulin 

repertoire. These species are for example chicken, sheep, cow, pig and rabbit Preferably, the cell 
is derived from a chicken Bursal lymphoma. Most preferably, the cell is derived from or related to 
theDT40 cell line. 

10 In a further embodiment, the cell according to the invention is capable of directed and selective 
genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion. The target nucleic acid may encode a protein or possess a 
regulatory activity. Examples of proteins are an immunoglobulin chain, a selection marker, a 
DNA-binding protein, an enzyme, a receptor protein or a part thereof. In a preferred embodiment, 

15 the target nucleic acid is the V(D)J segment of a rearranged human immunoglobulin gene. 

Examples of regulatory nucleic acids are a transcrQ)tion regulatory element or a RNAi sequence. 

In an embodiment, in which the target nucleic acid is diversified by a combination of 
hypermutation and gene conversion, the cell according to the invention contains at least one 
20 sequence capable of serving as a gene conversion donor for the target nucleic acid. 

In a further embodiment, the target nucleic acid is an exogenous nucleic acid operably linked to 
control nucleic acid sequences that direct goietic diversification. 

25 In an additional embodiment, the target nucleic acid is expressed in the cell according to the 
invention in a marmer diat facilitates selection of cells which exhibit a desired activity. The 
selection can be a direct selection for the activity of the target nucleic acid within ihe cell, on the 
cell sur^e or outside the cell. Alternatively, the selection can be an indirect selection for the 
activity of a reporter nucleic acid. 

30 

In a further embodiment, the invention provides for genetic means to modulate the genetic 
diversification of the target nucleic acid in the cell according to the invention. The modulation can 
be by modification of cis-acting regulatory sequences, by varying the number of gene conversion 
donors, or by modification of trans-acting regulatory factors such as activation-induced deaminase 
35 (AID) or a DMA repair or recombination &ctor other than a RAD5 1 analogue or paralogue. The 
cell preferably expresses activation-induced deaminase (AID) conditionally. 
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In a second aspect, there is provided a cell line derived from a cell according to the invention. la a 
preferred embodiment, the ceil line is DT40 or a modification thereof. 

In a third aspect, there is provided a transgenic non-human animal containing a lymphoid cell 
5 having gene conversion fiilly or partially replaced by hyp«mutation, wherein said cell has no 
deleterious mutations in genes encoding paralogues and analogues of the RAD5I protein, and 
wherein said cell is capable of directed and selective genetic diversification of a transgenic target 
nucleic acid by hypermutation or a combination of hypermutation and gene conversion. In a 
preferred embodiment, the animal is chicken. 

10 

In a further aspect, the invention provides a method for preparing a cell cj^able of directed and 
selective genetic diversification of a target nucleic acid by hypennutation or a combination of 
hypermutation and gene conversion. The method comprises (a) transfecting a lymphoid cell 
capable of gene conversion with a genetic construct containing the target nucleic acid, and (b) 
* 1 5 identifying a cell having the endogenous V-gene segment of a part thereof replaced with the target 
nucleic acid. 

According to a fiirther embodiment, the genetic construct containing the target nucleic acid further 
contains at least one nucleic acid capable of serving as a gene conversion donor for the target 
20 nucleic acid. The locus containing the target nucleic acid can be constmcted by a single 

transfection or multiple rounds of transfection with constructs containing different components of 
the locus. 

In the embodiment, in which selection for a cell with a desired activity is indirect, the method of 
25 the inv^tion further comprises (c) transfecting the cell from step (b) with a further genetic 
construct comprising a reporter gene capable of being influenced by the target nucleic acid. 

In a fiirther embodiment, the method of the invention fiarther comprises (d) conditional expressioa 
of a trans-acting regulatory factor. In a preferred embodiment, the trans-acting regulatory factor is 
30 activation-indaced deaminase (ADD). 

According to a particularly preferred embodiment, the target nucleic acid is inserted into the cell 
by targeted integration. 

35 In a fiirther aspect, there is provided a method for preparing a gene product having a desired 
activity, comprising the steps of: (a) culturing cells according to the invention under appropriate 
conditions to express the target nucleic acid, (b) identifying a cell or cells within fhe population of 
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cells which expresses a mutated gene product having the desired activity; and (c) establishing one 
or more clonal populations of cells from the cell or cells identified in step (b), and selecting from 
said clonal populations a cell or cells which expresses a gene product having an improved desired 
activity. 

5 

In one embodiment, steps (b) and (c) are iteiatively repeated until a gene product with an 
optimized desired activity is produced. 

According to a fiirtiier embodiment, the genetic diversification can be switched off, for example, 
10 by down-regulation of the expression of a trans-acting regulatory factor, when the cell producing a 
gene product with an optimized desired activity has been identified. Hie trans-acting regulatory 
factor can be, for example, activation-induced deaminase (AID) or a factor involved in 
homologous recombination or DNA repair, other than a RAD51 paralogue or analogue. 

15 In another embodiment, the diversification of the target nucleic acid is further modified by target 
sequence optimization such as the introduction of Ig hypermutation hotspots or an increased GC 
content. 

In a fiirther aspect of the present invention, there is provided the use of a cell capable of directed 
20 and selective genetic diversification of a target nucleic acid by hypermutation or a combination of 
hypermutation and gene conversion for the preparation of a gene product having a desired activity. 

DESCRIPTION OF THE FIGURES 

25 Fig. 1 H^V gene deletion (A) A physical map of ^e chicken rearranged Ig light chain locus 

and the xjfV knock-out constructs. The locus contains a total of 25 x|/V genes upstream of 
functional V segment. The kaock-out strategy of \|/V genes by the targeted integration of die 
p\}/VDell-25 and the p\[/VDel3-25 constructs is shown below. Only the relGvant EcoRI sites &re 
indicated. (B) Southern blot analysis of wild-type and knock-out clones using the probe shown in 

30 (A) after EcoRl digestion. The wild-type locus hybridizes as a 12-kb fi:agment, whraeas 

yyparti*! and \|/\rioci hybridize as a 7.4-kb and 6.3-kb fifBgment, respectively. (Q AID status. The' 
AID gene was amplified by PGR to verify the presence or absence of AID cDNA expression 
cassette. 

35 Fig. 2 sIgM expression analysis of control and knock-out clones (A) FACS anti-IgM 

staining profiles of representative subclones derived from initially 8lgM(+) clones. (B) Average 
percentages of events falling into sIgM(-) gates based on the measurement of 24 subclones. 
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Fig. 3 Ig light chain sequence analysis of the xj/V knock-out clones Mutation profdes of 
the AID and AJD\V^^ clones. All nucleotide substitutions identified in different 
sequences in the region from the leader sequraice to &e J-C intron are mapped onto the rearranged 
5 light chain sequence present in the AID^precursor clone. Mutations of the AIDV V and the 
^jIjR^y partial ^i^j^^ ^ sho wn above and below the reference sequence, respectively. Deletions, 
insertions and gene conversion events are also indicated. Hotspot motLd^ (RGYW and its 
complement WRCY) are highlighted by bold letters. 

10 Fig. 4 Mutation profiles of hypermutating cell lines (A) Perc& Jitages of sequences 

carrying a certain numb^ of mutations. Each imtemplated nucleotide substitution is counted, but 
gene conversion, deletions and insertions involving multiple nucleotides are counted as a single 
event. PM, point mutation; GC, gene conversion; D, deletion; I, insertion. (B) Pattern of nucleotide 
substitutions within sequences from \|/V and the XRCC3 knock-out clones. Nucleotide 

15 substitutions as part of gene conversion events are excluded. The ratios of transition (trs) to 

transversion (trv) are also shown, (C) Hotspot preference of untemplated nucleotide substitution 
mutations. Mutations occurring within a hotspot motif (RGYW or its complement WRCY) are 
shown by percentages. (D) Trypan-blue positive cells as an indicator of spontaneously dying cells. 

20 Fig. 5 Distribution of nucleotide substitutions within genomic sequences from nnsbrted 

AID VV cells and within cDNA sequences from sorted IgM (-) AIO^V cells The number of 
mutations are counted for every 50 bp, and are shown together with the corresponding physical 
maps of the light chain genomic locus or the cDNA sequence. 

25 Fig. 6 A model explaining the regulation of Ig gene conversioi& and Ig hypermutation 

Fig. 7 In sitti mutagenesis of the GFP gene (A) Ig VJ replacement vector. (B) in vivo 
mutagenesis of the GFP gene by hypennutation. (C) \|/V donor replacenaent vector. (D) in vivo 
mutagenesis of GFP gene by gene conversion and hypermutation. 

•30 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention makes available a particularly usefiil cell system for directed and selective 
genetic diversification of any nucleic acid by hypermutation or a combicaation of hypermutation 
35 and gene conversion. The system is based on B cell lines which constitatively diversift^ the 
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rearranged immunoglobxilin V-gene in vitro without requiring extracellular stimuli such as an 
interaction with other cells or molecules or maintenance of the B cell antigen receptor. 

As used herein, "directed and selective div^ification" refers to the ability of certain cells to cause 
S alteration of the nucleic acid sequence of a specific region of Oogenous or transgenic nucleic 
acid, whereby sequences outside of these regions are not subjected to mutation. 

"Genetic diversification" refers to alteration of individual nucleotides or stretches of nucleotides in 
a nucleic acid. Genetic diversification in the cells according to the invention occiurs by 
10 hypermutation, gene conversion or a combination of hypennutation and gene conversion. 

"Hypermutation" refers to the mutation of a nucleic acid in a cell at a rate above background. 
Preferably, hypennutation refers to a rate of mutation of between 10'^ and 10"^ bp"* generation \ 
This is greatly in excess of background mutation rates, which are of order of 10'^ to 10'*^ 
15 mutations bp*^ generation * (Drake et al. 1988) and of spontaneous mutations observed in PCR. 
Thirty cycles of amplification with Pfii polymerase would produce <0.05xl0*^ mutations bp*^ in 
the product, which in the present case would account for less than 1 in ICQ of fiie observed ■ 
mutations O^undberg et aL, 1991). 

20 "Gene conversion" refers to a phenomenon in which sequence information is transferred in 

unidirectional manner from one homologous allele to the other. Gene conversion may be the result 
of a DNA polymerase switching templates and copying fiom a homologous sequence, or the result 
of mismatch repair (nucleotides being removed from one strand and replaced by repair synthesis 
using the other strand) after the formation of a heteroduplex. 

25 

Hypennutation and gene conversion generate natural diversity within the immunoglobulin V(D) J 
segment of B cells, Hypennutation takes place in the germinal centers of such species as mouse 
and human following antigen stimulation. Goie conversion takes place in primary lymphoid 
organs like the Bursa of Fabricius or the gut-associated lymphoid tissue in such species as chicken, 
30 cow, rabbit, sheep and pig independent of antigen stimulation. In chick^ stretches from the 
upstream pseudo-V-genes are transfened into the rearranged V(D) J segment According to the 
present invention, therefore, the cell or cell line is preferably an immunoglobuhn-producing cell or 
cell line which is capable of diversifying its rean^ged immunoglobulin genes. 

35 A direct connection between the initiation of hypennutation and gene conversion is for the first 
time established in the experiments reported hereiiL Specifically, partial or complete deletion of 
pseudo-V-genes in a cell line which continues gene conversion in cell culture leads to the 
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activation of hypennutation in the immunoglobulin locus. Deletion of all pseudogenes results in 
the abolishment of gene conversion and simultaneous activation of high rates of hypermutation, 
whereas deletion of a few pseudogenes results in the down-regulation of gene conversion and 
simultaneous activation of hypennutation at rates lower than the ones observed for the complete 
5 pseudogene deletion. Therefore, Ihe number of available pseudogene donors directly correlates 
with gene conversion rates and inversely correlates with hypermutation rates. Gene conversion and 
hypermutation are established to be in a reciprocal relationship to each other. Thus, the present 
invention for the first time provides a cell system which allows to genetically diversify a target 
nucleic acid by a combination of hypermutation and gene conversion, whereby the contribution of 
1 0 these two phenomena can be regulated by changing the number of flie g^e conversion donors, 
their orientation or their degree or length of homology. 

An advantage of the cell system according to the invention over a cell system with only 
. hypermutating activity such as the one b^sed on the human Burkitt lymphoma cell line Ramos 

15 (WO00/22I11 and WO 02/100998) is the abiUty to combine genetic diversification by 
hypennutation and gene conversion in one cell. For exam|)le, more defined changes can be 
introduced into the target gene by gene conversion than by random hypermutatioii, since gene 
conversion donors can be engineered to contain sequences likely to influence the target nucleic 
acid activity in a favorable way. Gene conversion and hypennutation might thus increase the 

20 chance to produce desirable variants, since pre-tested sequence blocks are combined with random 
hypermutations. Pseudogenes with sequences identical to a certain region of the target gene can 
also be used to keep a part of the target nucleic acid stable by firequent conversions having the 
effect that the hypermutations persist only in the non-converting part This approach is useful 
when the target nucleic acid contains region which should remain stable for optimal activity. 

25 

An advantage of the cell system according to the invention over a cell system based on the 
suppression of homologous recombination activity in gene conversion active cells (WO 
02/100998) is genetic stability of the cell reflected in a normal proliferation rate, radiation 
resistance and DNA repair competence. 

30 

A particular advantage of the present cell system over all known systems is the ability of the cells 
according to the invention to integrate transfected nucleic acid constructs by targeted integration 
into the homobgous endogenous locus. 

3 5 'Targeted integration" is integration of a transfected nucleic acid construct comprising a nucleic 
acid sequence homologous to an endogenous nucleic acid sequence by homologous recombination 
into the endogenous locus. Targeted integration allows to directly insert any nucleic acid into a 
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defined chromosomal position. In a preferred embodiment, a nucleic acid encoding a gene product 
of interest is inserted by targeted integration into the immunoglobulin locus in place of die 
. reairanged V(D) J segment or a portion thereof. 

5 In a preferred embodiment, fhe cells according to the invention are derived or related to cdls 

which undergo Ig gene conversion in vivo. Cells which undergo Ig gene conversion in vivo are, for" 
example, surface Ig expressing B ceUs in primary lymphoid organs such as avian Bursal B cells. 
Lymphoma ceils, derived from B cells of primary lymphoid organs, are particularly good 
candidates for constructing cells and cell lines according to fhe present invention. In fhe most 
10 preferred embodiment, the cells are derived from a chicken Bursal lymphoma cell line DT40. 

The process of constitutive genetic diversification by hypermutation and gene conversion is used 
in the present invention to produce gene products with a desired, novel or improved, activity. 

15 A "target nucleic acid" is a nucleic acid sequence or chromosomal region in the cell according to 
the present invention which is subjected to direct and selective genetic diversification. Hie target 
nucleic acid can be either endogenous or transgenic and may com|)rise one or more transcription 
units encoding gene products. 

20 As used herein, a "transgene" is a nucleic acid molecule which is inserted into a cell, such as by 
transfection or transduction. For example, a transgene may comprise a heterologous transcription 
unit which may be inserted into the genome of a cell at a desired location. 

In one embodiment, transgenes are immunoglobulin V-genes as found in immunoglobulin- 
25 producing cells or fragments of V-genes. Preferably, the target nucleic acid is a human 

immunoglobulin V-gene. In this case, the cells according to the invention are "factories" of human 
antibody variants capable of binding to any given antigen. 

Altematively, the target nucleic acid is a non-immunoglobulin nucleic acid, for example a gene 
30 encoding selection markers, DNA-binding proteins, enzymes or receptor proteins. For example, a 
novel fluorescent selection marker can be produced by mutating a known marker by 
hypermutation or by a combination of hypermutation and gene conversion with help of other 
known markers widi a dijfferent fluorescent spectrum serving as gene conversion donors. 

35 In one embodiment of the invention, the target nucleic acid directly encodes a gene product of 
interest Gene diversification of such a nucleic acid will result in a truncation of the encoded gene 
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product or in a change of its primary sequence. With every round of diversification and selection, a 
ceD expressing the gene product with an improved activity is search for. 

Alternatively, the target nucleic acid is a regulatory element, for example, a tianscrq>tion 
5 regulatory element such as promoter or enhancer, or interfering UNA (RNAi). In this embodiment, 
an additional nucleic acid (reporter gene) which is influenced by the target nucleic acid and 
encodes an identifiable gene product is required to identify cells bearing the target nucleic acid of 
interest. 

10 In the embodiment, in which genetic diversification of the target nucleic acid takes place by a 
combination of hypiennutation and gene conversion, additional nucleic acids capable of serving as 
gene conversion donors are inserted into the cell genome, preferably upstream of the target nucleic 
acid. 

15 A "nucleic acids capable of serving as a gene conversion donor** is a nucleic acid having a 

sequence homologous to the target nucleic acid. Examples of natural gene conversion donors are 
pseudo-V-genes in the inmiunoglobulin locus of certain species. 

According to one embodiment of the invention, a cell capable of directed and selective 
20 diversification of the target nucleic acid is constructed by inserting the target nucleic acid into the 
host cell by targeted integration at a defined chromosomal site. For this purpose, the transfected 
constructs may contain upstream and downstream of the target nucleic acid sequences homologous 
to the desired chromosomal integration site. Preferably, the cell is constructed by rq)lacing the 
endogenous V-gene or segments thereof with a transgene by homologous recombination, or by 
25 gene targeting, such that the transgene becomes a target for the gene conversion and/or 
hypennutation events. 

In another embodiment, transgenes according to the invention also comprise sequences which 
direct hypennutation and/or goie conversion. Thus, an cntko locus capable of K^ressing a gene 
30 product and directing hypennutation and gene conversion to this transcription unit is transfen-ed 
into the cells and is actively diversified even after random chromosomal integration. 

Screening of clones having incorporated the transgene by targeted integration can be done by 
Southern blot analysis or by PGR. 



35 
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In a preferred embodiment, transgenes according to the invention contain a selectable marker gene 
which aUows selection of clones which have stablely integrated the transgene. This selectable 
maricer gene may subsequently be removed by recombination or inactivated by other means. 

5 The present invention further provides a method for preparing a gene product having a desired 
activity by repeated rounds of cell expansion and selection for cells bearing a target nucleic acid 
vwth a desired activity. As used herein, "selection" refers to the determination of the presence of 
sequence alterations in the target nucleic acid which result in a desired activity of the gene product 
encoded by the target nucleic acid or in a desired activity of the regulatory function of the target 
10 nucleic acid. 

The process of gene conversion and hypermutation is employed in vivo to generate improved or 
novel binding specificities in immunoglobulin molecules. Thus, by selecting cells according to the 
invention which produce immunoglobulins capable of binding to the desired antigen and then 
propagating these cells iii order to allow the generation of further mutants, cells which express 
immunoglobulins having improved binding to the desired antigen may be isolated. 

A variety of selection procedures may be applied for the isolation of mutants having a desired 
specificity. These include Fluorescence Activated Cell Sorting (FACS), cell separation using 
magnetic particles, antigen chromatography methods and other cell separation techniques such as 
use of polystyrene beads. 

Fluorescence Activated Cell Sorting (FACS) can be used to isolate cells on the basis of their 
differing surface molecules, for example suifece displayed immunoglobulins. Cells in the sample 
or population to be sorted are stained with specific fluorescent reagents which bind to the cell 
surface molecules. These reagents would be the antigen(s) of interest linked (either directly or 
indirectly) to fluorescent markers such as fluorescein, Texas Red, malachite green, green 
fluorescent protein (GFP), or any other fluorophoie known to those skilled in the art The cell 
population is then introduced into the vibrating flow chamber of the FACS machine. The cell 
stream passing out of the chamber is encased in a sheath of buffer fluid such as PBS (Phosphate 
Buffered Saline). The stream is illuminated by laser light and each cell is measured for 
fluorescence, indicating binding of the fluorescent labeled antigen. The vibration in the cell stream 
causes it to break up into droplets, which cany a small electrical charge. These droplets can be 
steered by electric deflection plates under computer control to collect different cell populations 
according to their affinity for the fluorescent labeled antigen. In this manner, cell populations 
which exhibit different affinities for the antigen(s) of interest can be easily separated from those 
cells which do not bind the antigen. FACS machines and reagents for use in FACS are widely 
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available from sources world-wide such as Becton-Diddnson, or from service providers such as 
Arizona Research Laboratories (http:/Avww.arl.ari20na,edu/facs/). 

Another method which can be used to separate populations of cells according to ttie affinity of 
5 their cell surface protein(s) for a particular antigen is affinity chromatography. In this method, a 
suitable resin (for example CL-600 Sepharose, Pharmacia Inc.) is covalently linked to the 
appropriate antigen. This resin is packed into a colunm, and the mixed population of cells is passed 
over the column. After a suitable period of incubation (for example 20 minutes), unboimd cells are 
washed away using (for example) PBS buffer. This leaves only that subset of cells expressing 

1 0 inununoglobulins which bound the antigen(s) of interest, and these cells arc then eluted from the 
column using (jbr example) an excess of the antigen of interest, or by enrymatically or chemically 
cleaving the antigen from the resin. This may be done usmg a specific protease such as factor X, 
thrombin, or other specific protease known to those skilled in the art to cleave the antigen from the 
column via an appropriate cleavage site which has previously been incoiporated into the antigen- 

15 resin complex. Alternatively, a non-specific protease, for example trypsin, may be employed to 
remove the antigra from the resm, thereby releasing fliat population of cells which exhibited 
affinity for the antigen of interest. 

The present invention provided for the first time a mechanism which allows to regulate genetic 
20 diversification of the target nucleic acid. As demonstrated by the present inventors, activation- 
induced deaminase (AID) is a factor which regulates gene conversion as well as hypermutation in 
the unmunoglobulin locus. It is suggested that AID induces a common modification in the 
rearranged V(D) J segment leading to a conversion tract in the presence of adjacent donor 
sequences aiid to a point mutation m tiieir absence. Therefore, by regulation of AID expression, 
25 botii phenomena can be modulated. In a prefeired embodiment, the AID gene is transiently 

expressed in the cell containing a target nucleic acid. For example, ADD can be expressed under a 
drug-responsive promoter such as the tetracycline responsible gene expression system. Otherwise 
the gene expression may be shut down by flie excision of the AID expression cassette by induced 
recombination. Switching off the AID expression will prevent further diversification of flie target 
30 sequence. Preferably, AID expression is switched off in the cell producing a gene product with a 
desired activity in order to prevent further mutations which can lead to the loss of the desired 
activity. 

The uivention is illustrated by the following examples. 

35 
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EXAMPLES 

1. AID initiates immunoglobulin gene conversion and hypermutation by a 
common intermediate 

5 

Herein it is reported that ablation of donors activates AID-dependent Ig hypermutation in 
chicken B cell line DT40. This shows that Ig gene conversion and hypermutation aie competing 
■ pathways derived from the same AID-initiated intermediate. Furthennore \|/V knock-out DT40 is 
proposed as an ideal model system to approach the molecular mechanism of Ig hypermutation and 
10 as a new tool for in situ mutagenesis. 

Methods 

Cell lines. DT40^^^ which displays increased Ig gene conversion due to a v-myb transgene 
15 and contains a tamoxifen inducible Cre recombinase has been described previously (Arakawa et 
aL. 2001). DT40^*AID-'" was generated by the targeted disruption of both AID alleles of DT40^' 
(Arakawa et al, 2002). AID*^ was derived from DT40^^AID'^ after stable integration of a floxed 
AID-IRES-GFP bicistronic cassette, in which both AID and GFP are expressed from the same p- 
actin promoter. MDVV" was derived from AID^by transfection of p\j/VDell-25 (Fig. 1 A). Stable 
20 transfectants which had integrated the construct into the rearranged light chain locus were then 

identified by locus specific PGR. Targeted integration of pvVDell-25 results in the deletion of the 
entire xj/V gene loci starting 0.4 kb upstream of \\fV25 and ending one bp downstream of x^Vl. 
AID VV**^°* was produced in a similar way as AIDVV" by transfection of p\|/VDeB-25 which 
upon targeted integration leads to a partial deletion of the nrV loci starting 0.4 kb upstream of 
25 y V2S and ending one bp downstream of \)/V3. Cell culture and electroporation were performed as 
previously described (Arakawa et al., 2002). XRCC3^" was derived from DT40^* by deleting 
amino acids 72-170 of XRCC3 gene following transfection of XRCC3 knock-out constructs. 
Clones having undergone targeted integration were initially identified by long-range PCR and the 
XRCC3 deletion was then confirmed by Southern blot analysis. 

30 

Ig reversion assay. Subcloning, antibody staining, flow cytometiy and quantification of 
sIgM expression has heea described previously (Arakawa et aL, 2002). All clones used in the 
study were sIgM(+) due to the repair of the light chain fi^cshift of the original C118(-) variant 
(Buerstedde et al, 1990) by a gene conversion event 
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PCR. To minimize PCR-introduced artificial mutations, PfuUltra hotstart polymerase 
(Stratagene) was used for amplification prior to sequencing. Long-range PCR, RT-PCR and Ig 
light chain sequencing were peifonned as previously described (Arakawa et al., 2002). The 
promoter and J-C intron region of Ig light ch^m plasmid clones were sequenced using the M13 
5 forward and reverse primers. Bu-I and EFla genes were amplified using BU1/BU2 (BUI, 
GGGAAGCTTGATCATITCCTGAATGCTATATTCA; BU2. 
GGGTCTAGAAACTCCTAGGGGAAACTTTGCTGAG) andEF6/EF8 (EF6, 
GGGAAGCTTCGGAAGAAAGAAGCTAAAGACCATC; EF8, 

GGGGCTAGCAGAAGAGCGTGCTCACGGGTCTGCC) primer pairs, respectively. The PCR 
10 products of these genes were cloned into the pBluescript plasmid vector^ and were sequenced 
using the M13 reverse primer. 

Results 



IS Targeted deletion of yV donor sequences in the rearranged light chain locus 

Two \|/V knock-out constructs were made by cloning genomic sequences, which flank the 
intended deletion end points, upstream and downstream of a floxed-gpt (guanine phosphoribosyl 
transferase) cassette (Arakawa et al., 2001). Upon targeted integration, the first construct, 
p\|;VDell-25, deletes all pseudogenes (\i/V25 to \|/V1) whereas the second construct, p\|/VDel3-25, 

20 deletes most pseudogenes (x^VlS to H/V3) (Fig. 1 A). A surface IgM positive (sIgM(+)) clone, 

derived fix)m DT40^* AID''" cells (Arakawa et al., 2002) by transfection and stable mtegration of a 
floxed AID-IRBS (internal ribosome entry site) -GFP transgene, was chosen for the transfection of 
the \j/V knock-out constructs. This AID reconstituted clone, named AID^ has the advantage that 
the appearance of deleterious Ig light chain mutations can be easily detected by the loss of sIgM 

25 expression and that GFP-marked AID expression can be shut down after tamoxifen induction of 
the Cre recombinase transgene inherited fi-om DT40^' (Arakawa et al., 2002). 

Followmg transfection of the yV knock-out constructs into the AID*^ clone, mycophenolic 
acid resistant clones containing targeted deletions of the rearranged light chain locus were 
identified. These primary \|/V knock-out clones contain two floxed transgenes, the inserted gpt 

30 marker gene in the rearranged light chain locus and the AID-IRES-GFP gene of the AED^ 
progenitor clone. Since the gpt gene might perturb the adjacent transcription or chromatin 
configuration, the primary \\fV knock-outs were exposed to a low concentratibn of tamoxifen and 
then subcloned by limited dilution. In this way, secondary \|fV knock-out clones could be isolated 
which had either deleted only the gpt gene (AID VV" and ASD\V^ or the gpt gene together 

35 with the AID-IRES-GFP gene (AID''>V" and AID^V^. The disraption of \f/ genes in the 
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reairanged light chain locus and the excision of AID over-expression cassette were confinned by 
Southern blot analysis (Fig. IB) and PGR (Fig. IC), respectively. 

Increased loss of sIgM expression after deletion of yV genes in AID positive clones 

5 To estimate the rates of deleterious Ig mutations, sig expression was measured by FACS 

after two weeks culture for 24 subclones each of the DT40^^ AID^, DT40^*AID^"and 
knock-out clones (Figs. 2A and 2B). Analysis of the controls with the intact \|fV locus revealed an 
average of 0.52% and 2.27% sIgM(-) cells for the DT40^' and AID^ subclones respectively, but 
only 0.08% for the DT40^^AID-'-. Previous analysis of spontaneously arising sIgM(-) DT40 

10 variants demonslrated that about a third contained firameshifl mutations in the reairanged light 
chain V segment which were regarded as byproducts of the Ig gene conversion activity 
(Buerstedde et al., 1990). This view is now supported by the finding diat the AID negative 
DT40°^' AID^' clone, which should have lost the Ig gene conversion activity, stably remains 
sIgM(+). Most interestingly, subclones of the AID positive \i/V knock-out clones (AIDVvp^** 

15 and AmVvO rapidly accumulate slgM(-) populations whereas subclones of the ATO negative vV 
knock-out clones (AID^>V»"^ and AID'''v|;\0.remain sIgM(+) (Figs. 2A and 2B). This suggests 
that the deletion of the pseudogcnes dramatically increases the rate of deleterious light chain 
mutations in AID ^pressing cells. 



Replaceinent of Ig gene conversion by hypermutation in the absence of v(fV donors 

To analyze the newly identified mutation activity, the rearranged light chain VJ segments of 
the \|/V knock-out clones were sequenced 5-6 weeks after subcloning. A total of 135 nucleotide 
changes (Fig. 4A, Table 1) were found in the 0.5 kb region between the V leader and tfie 5' end of 
the J-C intron within 95 sequences firom the AID\V clone (Fig. 3, above reference sequence). In 
contrast to the conversion tracts seen in wild-type DT40 cells, ahnost all changes are single base 
substitutions and apart fi-om a few short deletions and di-nucleotide changes, mutation chistere 
were not observed..The lack of conversion events in AIDVV", which still contains the \|/V genes 
of the unreairanged light chain locus, confirms thatig gene conversion only recruits the \|/V genes 
on the same chromosome for die diversification of the rearranged light chain gene (Carlson et al., 
1990). No sequence diversity was foimd in a collection of 95 light chain gene sequences from the 
AID'^'ii/V clone (Fig. 4A, Table 1), indicating that AID is required for the mutation activity. 

Sequences derived fi:om the AID Vv^' clone occasionally display stretches of mutations 
which can be accounted for by the remaining ^fVl and \^Y2 (Fig. 3, below reference sequence). 
Nevertheless, the majority of AID mutations are single untemplated base substitutions as 

seen with the AID cells (Fig. 4A, Table 1). Only 3 base substitutions, which possibly are PGR 
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artifacts, were found in 92 sequences of the AJD'^'xi/V^ clone confirming that both the gene 
conversion and the mutation activities of AID^V*'^ are AID dependent 

The new mutation activity of the knock-out dones closely resembles somatic 
5 hypermutation . 

The discovered Ig mutation activity in the \(rV knock-out clones vri& a predominance of 
single nucleotide substitutions suggests fhat somatic hypermutation had replaced Ig gene 
conversion. There is however a difference between the nucleotide substitutions in the AIDV^^^"^ 
and AIOVV^ clones and Ig hypermutations in germinal center B cells in that the clones show very 

10 few mutations in A/T bases and a preference for transversion mutations (Fig. 4B). 

Ig hypermutations are typically localized within one kb of the transcribed gene sequence 
with preferences for the Complementary Determining Regions (CDRs) of the V(D)J segments, 
whereas no or few mutations are present in ih& downstream C region (Lebeoque and Gea±art, 
1990). To investigate whether tiie mutations in the AIDV^ ^^^o"® follow a similar distribution, 

15 sequence analysis' was extended to the promoter region and the J-C intron of the rearranged light 
chain gene (Fig. 5). Although mutations are found close to the promoter and in the intron 
downstream of the J segments, the peak incidence clearly coincides with the CDRl and CDR3, 
which are also preferred sites of gene conversion m DT40 (uiq)ublished results). Approximately 
half of all point mutations faU within the RGYW (R = A/G; Y = C/T; W = A/I) sequence motif or 

20 its complement WRCY (Fig. 4C), known as hot spots of Ig hypermutation in humans and mice. 

It was previously reported that the deletion of RAD5 1 paralogues induces Ig hypermutation 
in DT40 (Sale et al., 2001). To compare the hypermutation activity in the \)/V gene negative and 
RAD51 paralogue negative backgrounds, the XRCC3 gene was disrupted in the DT40^^ clone 

25 and the rearranged VJ genes were sequenced 6 weeks after subcloning. Similar to the mutation 
spectrum in the AIDV^ clone and what was previously reported (Sale et aL, 2001), flie mutations 
in the sequences fix)m the XRCC3^' cells show a transversion preference and an absence of 
mutations in A/T bases (Fig. 4B). Nevertheless the mutation rate in the XRCC3 mutant was about 
2.5 fold lower than in the AIDVv clone and there was a clear slow growth phenotype of the 

30 XRCC3 mutant compared to wild-type DT40 and the AID clone (Fig. 4D). 

To identify the mutations responsible for the loss of s^gM expression in the AID V^" clone, 
94 light chain cDNAs from sorted sIgM(-) cells were amplified and sequenced. Although one 
short insertion and five deletions were detected in this collection (Table 1), 89% of the 245 total 
35 mutations are single nucleotide substitutions within the VJ segments (Fig. 5). Suiprisingly, only 
about 10% of the sequences contained a stop codon or a firameshift, suggesting that the lack of 
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sIgM(-) expression is mainly caused by amino acid substitutions which affect the pairing of flie Ig 
light and heavy chain proteins. 

Ig locus specificity of hypermutation 

It has been reported that high AID expression in fibroblasts (Yoshikawa et al., 2002) and B 
cell hybridomas (Martin and Scharff, 2002) leads to frequent mutations in transfected transgenes. 
To rule out that the pseudogene deletions had induced a global hypermutator phenotype, the 5' 
ends of the genes encoding the B cell specific marker Bu-1 and the translation elongation factor 
EFla were sequenced for the ADDVv clone. Only a single one bp deletion was found within 95 
sequences of the Bu-1 gene and only two single nucleotide substitutions within 89 sequences of 
EFla (Table 1). As these changes most likely represent PGR artifacts, this further supports the 
view that the hypennutations induced by the v|/V deletions are Ig locus specific. 

Discussion 

The results demonstrate that the deletion of the nearby pseudogene donors abolishes Ig gene 
conversion in DT40 and activates a mutation activity whidi closely resembles Ig hypermutation. 
The features shared between the new activity and somatic hypermutation include 1) AID 
dependence, 2) a predominance of single nucleotide substitutions, 3) distribution of the mutations 
within the 5' transcribed region, 4) a preference for hotspots and 5) Ig gene specificity. The only 
difference with regard to Ig hypermutation in vivo is the relative lack of mutations in A/T bases 
and a predominance of transversion mutations in die \|fV knock-out clones. However, this 
difference is also seen in hypennutating EBV transfonned B cell lines (Bachl and Wabl, 1996; 
FaiH et al, 2002) and DT40 mutants of RAD51-paralogues (Sale et al., 2001) indicating that part 
of the Ig hypennutator activity is missing in transformed B cell lines. Interestingly, the rate of Ig 
hypermutation in the AIDN|/V clone seems higher than the rate of Ig gene conversion in the 
DT40^* progenitor. An explanation for this could be that some conversion tracts are limited to 
stretches of identical donor and target sequences and thus leave no trace. 

The induction of Ig hypermutation by the blockage of Ig gene conversions supports a simple 
model explaining how hypermutation and recombination is initiated and regulated (Fig. 6). At the 
top of the events is a modification of the rearranged V(D) J segment which is either directly or 
indirecdy induced by AID, The default processing of this lesion in the absence of nearby donors or 
in the absence of high homologous recombination activity leads to Ig hypermutation in form of a 
single nucleotide substitution (Fig. 6, right side). However, if donor sequences are available, 
processing of die AID induced lesion can be divided into a stage before strand exchange, when a 
shift to Ig hypermutation is still possible and a stage after strand exchange when the conmiitment 
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toward Ig gene conversion has been made (Fig. 6, left side). Whereas completion of the first stage 
requires the participation of the RAD51 paralogues, the second stage involves other recombination 
factors like the RAD54 protein. 

5 This difference in commitment explains why disnq>tions of the RAD51 paralogues not only 

decrease Ig gene conversion, but also induce Ig hypennutation (Sale et al., 2001) whereas 
disruption of the RAD54 gene only decreases Ig gene conversion (Bezzubova et al., 1997). Hie 
model also predicts that low cellular homologous recombination activity prevents Ig gene 
conversion even in the presence of conversion donors. Such a low homologous recombination 
10 activity might be the reason why human and murine B cells never use Ig gene conversion despite 
the presrace of neaiby candidate donors in form of unrearranged V segments and why chicken 
genninal center B cells shave shifted from Ig gene conversion to Ig hypermutation (Arakawa et al., 
1998). 

1 5 The AID^ and the \j/V knock-out DT40 clones are a powerful experimental system to 

address the role of trans-acting factors and cis-acting regulatory sequences for Ig gene conversion 
and hypermutation. Compared to alt^ative animal or cell culture systems it offers the advantages 
of: 1) parallel analysis of Ig gene conversion and Ig hypermutation, 2) conditional AID expression, 
3) easy genome modifications by gene targeting, 4) normal cell proliferation and repair proficiency 

20 and 5) Ig locus specificity of hypennutation. The ability to induce gene specific hypermutation in 
the DT40 cell line might also find applications in biotechnology. One possibility is to replace the 
chicken antibody coding regions by their human counterparts and then to simulate antibody 
affinity maturation &om a rq>ertoire which continuously evolves by Ig hypermutation. 

25 2. Targeted in vivo mutagenesis of GFP by gene conversion and 
hypermutation 

The gene encoding Green Fluorescent Protein (GFP) is an example of a target nucleic acid which 
can be genetically diversified using the cell system of the invention, in particular the DT40 cell 
30 line. The GFP gene inserted into the Ig light chain locus by targeted integration will be subjected 
to hypermutation and its activity with respect to color, intensity and half-life will evolve with time 
(Fig. 7B). If a combination of hypermutation and gene conversion is used to modify flie GFP 
activities, variant GFP sequences which can serve as gene conversion donors for GFP are also 
inserted into the Ig locus (Fig. 7D). 

35 An Ig VJ replacement vector, pVjRepBsr, v/hich. allows to replace the Ig light chain VJ gene by 
any nucleic acid target is depicted in Fig. 7A. A potential target for mutagenesis can be cloned into 
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Spel site, which is compatible with Xbal, Nhel, Avrll and Spel sites. For example, the GFP gene 
can be inserted into the Ig light chain locus by targeted integration using pVjRepBsr. A \[fV-gene 
donor replacement vector, pPseudoRepBsr, which allows to replace the Ig \|/V gene light chain 
locus by any nucleic acid target is depicted in Fig. 7C. Potential gene conversion donors can be 
5 cloned into either Nhel or Spel site, which is compatible with Xbal, Nhel, Avrll and Spel sites. 
Because Nhel site is located between two loxPs, this site can be used for conditional knockout 
design. By stepwise targeted integration using pPseudoRepGpt and pVjRepBsr, \|/V genes can be 
replaced by \|;GFP gene and its variants (e.g. \|/CFP: cyano fluorescence protein and yYFP: yellow 
fluorescence protein) and the VJ gene can be replaced by GFP carrying a frameshift mutation 
10 (FsGFP) to monitor genetic diversification of the GFP gene. The frameshift in FsGFP is expected 
' to be repaired by gene conversion of \|/GFP, \|/CFP and \|/YFP as templates. In addition, the gene 
will be further diversified by hypennutation. 
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